Search CORE

104 research outputs found

Error-Correcting Tournaments

Author: A.C. Yao
B. Ravikumar
E. Allwein
J. Fox
J. Langford
P. Denejko
T. Dietterich
Publication venue
Publication date: 01/01/2008
Field of study

We present a family of pairwise tournaments reducing

k

-class classification to binary classification. These reductions are provably robust against a constant fraction of binary errors. The results improve on the PECOC construction \cite{SECOC} with an exponential improvement in computation, from

O(k)

O(\log_2 k)

, and the removal of a square root in the regret dependence, matching the best possible computation and regret up to a constant.Comment: Minor wording improvement

arXiv.org e-Print Archive

CiteSeerX

Crossref

CSNL: A cost-sensitive non-linear decision tree algorithm

Author: Allwein E. L.
Bennett K. P.
Bradford J.
Breslow L.
Brown G.
Elkan C.
Fan W.
Kanani P.
Knoll U.
Martin A.
Masnadi-Shirazi H.
Pazzani M.
Provost F. J.
Sunil Vadera
Ting K.
Ting K.
Turney P.
Vadera S.
Vadera S.
Zadrozny B.
Zhu X.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

This article presents a new decision tree learning algorithm called CSNL that induces Cost-Sensitive Non-Linear decision trees. The algorithm is based on the hypothesis that nonlinear decision nodes provide a better basis than axis-parallel decision nodes and utilizes discriminant analysis to construct nonlinear decision trees that take account of costs of misclassification. The performance of the algorithm is evaluated by applying it to seventeen datasets and the results are compared with those obtained by two well known cost-sensitive algorithms, ICET and MetaCost, which generate multiple trees to obtain some of the best results to date. The results show that CSNL performs at least as well, if not better than these algorithms, in more than twelve of the datasets and is considerably faster. The use of bagging with CSNL further enhances its performance showing the significant benefits of using nonlinear decision nodes. The performance of the algorithm is evaluated by applying it to seventeen data sets and the results are compared with those obtained by two well known cost-sensitive algorithms, ICET and MetaCost, which generate multiple trees to obtain some of the best results to date. The results show that CSNL performs at least as well, if not better than these algorithms, in more than twelve of the data sets and is considerably faster. The use of bagging with CSNL further enhances its performance showing the significant benefits of using non-linear decision nodes

CiteSeerX

University of Salford Institutional Repository

Crossref

Reliability Maps:A Tool to Enhance Probability Estimates and Improve Classification Accuracy (Best paper award)

Author: A. Bella
A.C. Lorena
A.H. Murphy
B. Zadrozny
E. Allwein
G. Shafer
G.J. Székely
J. Fan
J.D. Zhou
M. Galar
P.N. Bennett
R.E. Schapire
T. Dietterich
T. Windeatt
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Explore Bristol Research

Building Behavior Scoring Model Using Genetic Algorithm and Support Vector Machines

Author: D. West
E. Allwein
J.P. Li
R. Kohavi
R. Nath
S. Chen
V.N. Vapnik
Z. Michalewicz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

Max-Margin Dictionary Learning for Multiclass Image Categorization

Author: B. Fulkerson
E. Allwein
F. Moosmann
F. Perronnin
J. Platt
K. Huang
L. Fei-Fei
N. Shor
S. Lazebnik
Y.G. Jiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Abstract. Visual dictionary learning and base (binary) classifier train-ing are two basic problems for the recently most popular image cate-gorization framework, which is based on the bag-of-visual-terms (BOV) models and multiclass SVM classifiers. In this paper, we study new algo-rithms to improve performance of this framework from these two aspects. Typically SVM classifiers are trained with dictionaries fixed, and as a re-sult the traditional loss function can only be minimized with respect to hyperplane parameters (w and b). We propose a novel loss function for a binary classifier, which links the hinge-loss term with dictionary learning. By doing so, we can further optimize the loss function with respect to the dictionary parameters. Thus, this framework is able to further increase margins of binary classifiers, and consequently decrease the error bound of the aggregated classifier. On two benchmark dataset

CiteSeerX

Crossref

Efficient alignment-free DNA barcode analytics

Author: A Zhang
B Holmes
B Schölkopf
BJ Frey
C Leslie
CS Leslie
CS Leslie
CS Leslie
D Steinke
E Wong
EL Allwein
G Saunders
I Kononenko
J Robins
KF Armstrong
M Linares
M Stoeckle
ML Sogin
MV Matz
MW Chase
P Kuksa
P Smith
Pavel Kuksa
PDN Hebert
PDN Hebert
R Barrett
R Kuang
R Nielsen
RD Ward
S Menchetti
T Jaakkola
V Vapnik
Vladimir Pavlovic
W Kress
WJ Kress
WM Rand
Z Abdo
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Building multiclass classifiers for remote homology detection and fold recognition

Author: A Heger
A Krogh
A Sun
AG Murzin
B Taskar
C Leslie
C Leslie
CA Orengo
CD Huang
CH Ding
D Mittelman
E le
E Lindahl
EL Allwein
F Aiolli
F Rosenblatt
George Karypis
H Rangwala
H Saigo
Huzefa Rangwala
I Tsochantaridis
J Rousu
J Shi
J Weston
K Crammer
K Crammer
L Holm
L Liao
M Collins
M Collins
M Marti-Renom
P Baldi
R Kuang
R Rifkin
S Altschul
SB Needleman
SE Brenner
T Jaakkola
T Jaakkola
T Joachims
TF Smith
TG Dietterich
V Vapnik
W Pearson
Y Guermeur
Y Guermeur
Y Hou
Y Hou
Z Barutcuoglu
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Protein remote homology detection and fold recognition are central problems in computational biology. Supervised learning algorithms based on support vector machines are currently one of the most effective methods for solving these problems. These methods are primarily used to solve binary classification problems and they have not been extensively used to solve the more general multiclass remote homology prediction and fold recognition problems. RESULTS: We present a comprehensive evaluation of a number of methods for building SVM-based multiclass classification schemes in the context of the SCOP protein classification. These methods include schemes that directly build an SVM-based multiclass model, schemes that employ a second-level learning approach to combine the predictions generated by a set of binary SVM-based classifiers, and schemes that build and combine binary classifiers for various levels of the SCOP hierarchy beyond those defining the target classes. CONCLUSION: Analyzing the performance achieved by the different approaches on four different datasets we show that most of the proposed multiclass SVM-based classification approaches are quite effective in solving the remote homology prediction and fold recognition problems and that the schemes that use predictions from binary models constructed for ancestral categories within the SCOP hierarchy tend to not only lead to lower error rates but also reduce the number of errors in which a superfamily is assigned to an entirely different fold and a fold is predicted as being from a different SCOP class. Our results also show that the limited size of the training data makes it hard to learn complex second-level models, and that models of moderate complexity lead to consistently better results

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Minnesota Digital Conservancy

Multiclass classification of microarray data samples with a reduced number of genes

Author: A Alizadeh
A Berger
A Dupuy
A Statnikov
A Statnikov
AI Su
C Ambroise
C Furlanello
CE Shannon
CF Aliferis
DJC Mackay
DK Slonim
E Tapia
EL Allwein
Elizabeth Tapia
F Azuaje
F Masulli
FR Kschischang
G James
G Salton
I Guyon
I Shmulevich
I Tsamardinos
I Witten
J Fan
J Hadar
J Khan
J Zhu
JE Staunton
K Yeung
KH Liu
L Breiman
Laura Angelone
Leonardo Ornella
M Dettling
M Hollander
MA Delgado
N Cristianini
Pilar Bulacio
R Rifkin
R Rifkin
RM Fano
S Dudoit
S Huang
S Lee
S Pomeroy
T Abeel
T Furey
T Li
TG Dietterich
TM Cover
V Guruswami
V Vapnik
X Qiu
Y Lin
Y Saeys
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Multiclass classification of microarray data samples with a reduced number of genes is a rich and challenging problem in Bioinformatics research. The problem gets harder as the number of classes is increased. In addition, the performance of most classifiers is tightly linked to the effectiveness of mandatory gene selection methods. Critical to gene selection is the availability of estimates about the maximum number of genes that can be handled by any classification algorithm. Lack of such estimates may lead to either computationally demanding explorations of a search space with thousands of dimensions or classification models based on gene sets of unrestricted size. In the former case, unbiased but possibly overfitted classification models may arise. In the latter case, biased classification models unable to support statistically significant findings may be obtained. Results A novel bound on the maximum number of genes that can be handled by binary classifiers in binary mediated multiclass classification algorithms of microarray data samples is presented. The bound suggests that high-dimensional binary output domains might favor the existence of accurate and sparse binary mediated multiclass classifiers for microarray data samples. Conclusions A comprehensive experimental work shows that the bound is indeed useful to induce accurate and sparse multiclass classifiers for microarray data samples.</p

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

CONICET Digital

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Maximizing upgrading and downgrading margins for ordinal regression

Author: A Shashua
AP Bradley
Belen Martin-Barragan
C Cortes
DJ Hand
E Bredensteiner
E Carrizosa
E Carrizosa
E Carrizosa
E Carrizosa
E Grigoroudis
EL Allwein
Emilio Carrizosa
F Plastria
G Ballarino
H Nakayama
J Mercer
J Shawe-Taylor
JC Platt
JP Pedroso
JS Cardoso
L Li
MA Kupinski
N Cristianini
NM Adams
OL Mangasarian
R Herbrich
R Lall
RM Everson
T Hastie
T Jiao
V Vapnik
V Vapnik
W Chu
W Waegeman
Y Guermeur
Y Jin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2011
Field of study

In ordinal regression, a score function and threshold values are sought to classify a set of objects into a set of ranked classes. Classifying an individual in a class with higher (respectively lower) rank than its actual rank is called an upgrading (respectively downgrading) error. Since upgrading and downgrading errors may not have the same importance, they should be considered as two different criteria to be taken into account when measuring the quality of a classifier. In Support Vector Machines, margin maximization is used as an effective and computationally tractable surrogate of the minimization of misclassification errors. As an extension, we consider in this paper the maximization of upgrading and downgrading margins as a surrogate of the minimization of upgrading and downgrading errors, and we address the biobjective problem of finding a classifier maximizing simultaneously the two margins. The whole set of Pareto-optimal solutions of such biobjective problem is described as translations of the optimal solutions of a scalar optimization problem. For the most popular case in which the Euclidean norm is considered, the scalar problem has a unique solution, yielding that all the Pareto-optimal solutions of the biobjective problem are translations of each other. Hence, the Pareto-optimal solutions can easily be provided to the analyst, who, after inspection of the misclassification errors caused, should choose in a later stage the most convenient classifier. The consequence of this analysis is that it provides a theoretical foundation for a popular strategy among practitioners, based on the so-called ROC curve, which is shown here to equal the set of Pareto-optimal solutions of maximizing simultaneously the downgrading and upgrading margins

Crossref

Edinburgh Research Explorer

idUS. Depósito de Investigación Universidad de Sevilla

Ternary Bradley-Terry model-based decoding for multi-class classification and its extensions

Author: A. Bhattacharjee
A. W. Vaart Van der
B. Zadrozny
C. Angulo
C. Angulo
E. L. Allwein
F. Cutzu
J. Weston
K. Crammer
M. Moreira
N. Murata
N. Yukinawa
O. Dekel
P. D. Allison
R Development Core Team
R. A. Bradley
R. E. Schapire
R. E. Schapire
R. Tibshirani
Shin Ishii
T. G. Dietterich
T. Hastie
T. Hastie
T. Takenouchi
T. Takenouchi
T. Windeatt
Takashi Takenouchi
V. Vapnik
Y. Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref